Physical Representation Learning and Parameter Identification from Video Using Differentiable Physics
نویسندگان
چکیده
Abstract Representation learning for video is increasingly gaining attention in the field of computer vision. For instance, prediction models enable activity and scene forecasting or vision-based planning control. In this article, we investigate combination differentiable physics spatial transformers a deep action conditional representation network. By our model learns physically interpretable latent can identify physical parameters. We propose supervised self-supervised methods architecture. experiments, consider simulated scenarios with pushing, sliding colliding objects, which also analyze observability properties. demonstrate that network learn to encode images properties like mass friction from videos sequences. evaluate accuracy training methods, ability method predict future frames input actions.
منابع مشابه
Video Representation Learning Using Discriminative Pooling
Popular deep models for action recognition in videos generate independent predictions for short clips, which are then pooled heuristically to assign an action label to the full video segment. As not all frames may characterize the underlying action—indeed, many are common across multiple actions—pooling schemes that impose equal importance on all frames might be unfavorable. In an attempt to ta...
متن کاملLearning Compact Appearance Representation for Video-based Person Re-Identification
This paper presents a novel approach for video-based person re-identification using multiple Convolutional Neural Networks (CNNs). Unlike previous work, we intend to extract a compact yet discriminative appearance representation from several frames rather than the whole sequence. Specifically, given a video, the representative frames are selected based on the walking profile of consecutive fram...
متن کاملA Differentiable Physics Engine for Deep Learning in Robotics
One of the most important fields in robotics is the optimization of controllers. Currently, robots are often treated as a black box in this optimization process, which is the reason why derivative-free optimization methods such as evolutionary algorithms or reinforcement learning are omnipresent. When gradient-based methods are used, models are kept small or rely on finite difference approximat...
متن کاملLearning Physics with Video Analysis
Inspired by the pioneering work in photographic studies of motion and motion picture projection by Eadweard Muybridge in 1878 and by the high speed films of Harold Edgerton by the middle of the 20th century, the use of video has rapidly emerged nowadays as a powerful tool to teach physics at schools and universities, capturing what the human eye could not distinguish. To highlight the advantage...
متن کاملthe relationship between using language learning strategies, learners’ optimism, educational status, duration of learning and demotivation
with the growth of more humanistic approaches towards teaching foreign languages, more emphasis has been put on learners’ feelings, emotions and individual differences. one of the issues in teaching and learning english as a foreign language is demotivation. the purpose of this study was to investigate the relationship between the components of language learning strategies, optimism, duration o...
15 صفحه اولذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Vision
سال: 2021
ISSN: ['0920-5691', '1573-1405']
DOI: https://doi.org/10.1007/s11263-021-01493-5